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Abstract 

We continue the study of welfare maximization in unit-demand (matching) markets, in a distributed 
information model where agent’s valuations are unknown to the central planner, and therefore commu¬ 
nication is required to determine an efficient allocation. Dobzinski, Nisan and Oren (STOC’14) showed 
that if the market size is n, then r rounds of interaction (with logarithmic bandwidth) suffice to obtain 
an n^/*'’'"'"^rapproximation to the optimal social welfare. In particular, this implies that such markets 
converge to a stable state (constant approximation) in time logarithmic in the market size. 

We obtain the first multi-round lower bound for this setup. We show that even if the allowable per- 
round bandwidth of each agent is the approximation ratio of any r-round (randomized) protocol 

is no better than ), implying an f2(loglogn) lower bound on the rate of convergence of the 

market to equilibrium. 

Our construction and technique may be of interest to round-communication tradeoffs in the more 
general setting of combinatorial auctions, for which the only known lower bound is for simultaneous 
(r = 1) protocols | DN014 |. 
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1 Introduction 


This paper studies the tradeoff between the amount of communication and the number of rounds of inter¬ 
action required to find an (approximately) optimal matching in a bipartite graph. In our model there are n 
“players” and m “items”. Each player initially knows a subset of the items to which it may be matched (i.e. 
m bits of information). The players communicate in rounds: in each round each player writes a message on 
a shared blackboard. The message can only depend on what the player knows at that stage: his initial input 
and all the messages by all other players that were written on the blackboard in previous rounds. 

This problem was recently introduced by |DN014] as a simple market scenario: the players are unit- 
demand bidders and our goal is to find an (approximately) welfare-maximizing allocation of the items to 
players. The classic auction of |DGS86) - that may be viewed as a simple Walrasian-like market process 
for this setting - can be implemented as to find an approximately optimal allocation where each player 
needs only send O(logn) bits of communication (on the average). The question considered by [DN014] was 
whether such a low communication burden suffices without using multiple rounds of interaction. As a lower 
bound, they proved that a non-interactive protocol, i.e. one that uses a single round of communication, 
cannot get a n^/^“'^-factor approximation (for any fixed e > 0) with bits of communication per player. 
As upper bounds they exhibited (I) an 0(log n)-round protocol, where each player sends O(logn) bits per 
round, that gets a j^-factor approximation (for any fixed 5 > 0) and (II) for any fixed r > 1, an r-round 
protocol, where each player sends O(logn) bits per round, that gets an 0(n^'^*^’'“''^))-approximation. 

The natural question at this point is whether there are r-round protocols with better approximation 
factors that still use bits of communication per player. This question was left open in [DNOI4j . where 
it was pointed out that it was even open whether the exactly optimal matching can be found by 2-round 
protocols that use 0(log n) bits of communication per player. We answer this open problem by proving lower 
bounds for any fixed number of rounds. 

Theorem: For every r > 1 there exists e(r) = exp(—r), such that every (deterministic or randomized) 
r-round protocol requires bits of communication per player in order to find a matching whose size is at 
least fraction of the optimal matching. 


Our Techniques. We construct a recursive family of hard distributions for every fixed number of com¬ 
munication rounds, and use information theoretic machinery to analyze it. Our proof uses a type of direct- 
sum based round-reduction argument for multiparty communication complexity. Unlike standard round- 
elimination arguments in the two-party model, our instance size (and thus the number of players) scales 
with the number of allowable rounds, and therefore eliminating a communication round essentially requires 
embedding a “low dimensional” instance (with fewer players) into a “higher dimensional” protocol (oper¬ 
ating over a larger input), from its second round onwards. In order to carry out such an embedding, we 
need a way of sampling the rest of the inputs to the higher dimensional protocol (including the remaining 
players) conditioned on the first message of the protocol, with no extra communication. The main obstacle 
is that conditioning on the first message of the “high dimensional” protocol correlates the private inputs of 
the players (i.e., the inputs to the “lower dimensional” protocol) with the “missing” inputs, and it is not 
hard to see that, in general, such sampling cannot be done without communication! Circumventing this 
major obstacle calls for a subtle construction and analysis, which ensures the aforementioned correlations 
remain “local” and therefore allows to perform the embedding using a combination of private and public 
randomness. Our constructed family of distributions is designed to facilitate such embedding (using certain 
conditional independence properties) on one hand, and yet retain a “marginal indistinguishability” property 
which is essential to keep the information argument above valid (we discuss this further in Section |4|). 

1.1 More context and related models 

The bipartite matching problem is clearly a very basic one and obviously models a host of situations beyond 
the economic one that was the direct motivation of |DN014] and of this paper. Despite having been widely 
studied, even its algorithmic status is not well understood, and it is not clear whether a nearly-linear time 
algorithm exists for it. (The best known running time (for the dense case) is the 40-year old 












algorithm of |HK73) . but for special cases like regular or near-regular graphs nearly linear times are known 
(e.g. |Alo03[ lYuslS) ')'). In parallel computation, a major open problem is whether bipartite matching can be 
solved in deterministic parallel poly-logarithmic time with a polynomial number of processors (Randomized 
parallel algorithms for the problem [MVV871IKUW85] have been known for over 25 years). It was suggested 
in |DN014] that studying the problem in the communication complexity model is an approach that might 
lead to algorithmic insights as well. 

The bipartite matching problem has been studied in various other multi-party models that focus on 
communication as well. In particular, strong and tight bounds for approximate matching are known in the 
weaker “message passing” or “private channels” models |HRVZ13] that have implications to models of parallel 
and distributed computation . Related work has also been done in networked distributed computing models, 
e.g., [LPSP08]. “One-way” communication models are used to analyze streaming or semi-streaming models 
and some upper bounds (e.g., |Kapl2| ) as well as weak lower bounds |GKK12| are known for approximate 
matchings in these models. For “r-way” protocols, a super-linear communication lower bound was recently 
shown by [G013) for exact matchings, in an incomparable mode0. A somewhat more detailed survey of 
these related models can be found in the appendix of |DN014) . 

It should be noted that the open problems mentioned above remain so even in the standard two-party 
setting where each of the two players holds all the information of n/2 of our players. We do not know any 
better upper bounds than what is possible in the multi-player model, and certainly, as the model is stronger, 
no better lower bounds are known. We also do not know whether our lower bound (or the single round one 
of IDNOldp applies also in this stronger two-player model. 

1.2 Open problems 

There are many open problems related to our work. Let us mention a few of the most natural ones. Our first 
open problem is closing the gap between our lower bound and the upper bound: We show that r = G(log log n) 
rounds of communication are required to achieve constant approximation ratio using poly-logarithmic bits 
per player, while the upper bound is r = O(logn). We believe that the upper bound is in fact tight, and 
improving the lower bound is left as our first and direct open problem. 

Another interesting direction is trying to extend our lower bound technique to obtain similar-in-spirit 
round-communication tradeoffs for the more general setup of combinatorial auctions, also studied by |DN014] . 
From a communication complexity perspective, lower bounds in this setup are more compelling, since player 
valuations require exponentially many bits to encode, hence interaction has the potential to reduce the over¬ 
all communication (required to obtain efficient allocations) from exponential to polynomial. Indeed, it is 
shown in [PNG 14] that, in the case of sub-additive bidders, there is an r-round randomized protocol that 
obtains an 0{r ■ m^/^’’“'"^^)-approximation to the optimal social welfare, where in each round each player 
sends poly{m,n) bits. Once again, an (exponential in m) lower bound on the communication was given only 
for the case of simultaneous protocols (r = 1) and the natural question is to extend it to multiple rounds as 
well. 

A more general open problem advocated by [DN014| is to analyze the communication complexity of 
finding an exact optimal matching. One may naturally conjecture that rounds of interaction are 

required for this if each player only sends bits in each round, but no super-logarithmic bound is known. 
The communication complexity of the problem without any limitation on the number of rounds is also open: 
no significantly super linear, tLi(nlogn), bound is known, while the best upper bound known is 


^Besides of the fact that the lower bound in |GQ13| applies only for testing exact matchings and not approximate matchings, 
their model consists of a small number of parties (constant or logarithmic in n) who are communicating in some fixed number 
of sequential rounds (not simultaneous). The input itself of each player is therefore super-linear in the number of nodes of the 
input graph (n), and indeed they prove a super-linear communication lower bound, which is clearly impossible in our model. 
The |GQ13| model was motivated by streaming lower bounds and does not seem to capture the economic scenario we attempt 
to model in this paper (i.e., that of private-valuations) and therefore this result is incomparable to ours, as also evidenced by 
the distinct proof-techniques. 
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2 preliminaries 

We reserve capital letters for random variables, and calligraphic letters for sets. The (statistical) distance 
between two distributions in the same probability space is denoted |/j, — ^| := i • |Ai(a) — We write 

X ± Y j Z to denote that X and Y are statistically independent conditioned on the random variable Z. 
For a vector random variable X = X1X2 ... Xg, we sometimes use the shorthands X<i and X-i to denote 

X1X2 ■ ■ ■ Xi and X1X2 ■ ■ ■ W-i, W+i). Xg respectively (similarly, := X^X^ ... X'‘~^X‘‘~^^ ... X^). 

We write A Gr U to denote a uniformly distributed random variable over the set K. We use the terms 
“bidders” and “players” interchangeably throughout the paper. 


2.1 Communication Model 

Our framework is the number-in-hand (NIH) multiparty communication complexity model with shared black¬ 
board. In this model, n players receive inputs {xi,X2, ■ ■ ■, Xn) G X1XX2X .. .Xn respectively. In our context, 
each of the n players (bidders) is associated with a node u G U = [n] ot some bipartite graph G = {U, V, E), 
and her input is the set of incident edges on her node (her demand set of items in 1^ = [m]). The players’ 
goal is to compute a maximum set of disjoint connected pairs (u, u) G E{G), i.e., a maximum matching in 
G (we define this formally below). 

The players communicate in some fixed number of rounds r, where in each communication round, players 
simultaneously write (at most) i bits each on a shared blackboard which is viewable to all parties. We 
sometimes refer to the parameter £ as the bandwidth of the protocol. In a deterministic protocol, each 
player’s message should be completely determined by the content of the blackboard and her own private 
input Xi- In a randomized protocol, the message of each player may further depend on both public and 
private random coins. When player’s inputs are distributional {{xi,X2, ■ ■ ■ ,Xn) /i) which is the setting in 
this paper, we may assume without loss of generality that the protocol is deterministic, since the averaging 
principle asserts that there is always some fixing of the randomness that will achieve the same performance 
with respect to /i. We remark that by the averaging principle, our main result applies to the randomized 
setting as welH. 

The transcript of a protocol tt (namely, the content of the blackboard) when executed on an input graph 
G is denoted by n(G'), or simply 11 when clear from context. At the end of the r’th communication round, a 
referee (the “central planner” in our context) computes a matching M (11), which is completely determined 
by n. We call this the output of the protocol. 

We will be interested in protocols that compute approximate matchings. To make this more formal, let 
Q{n,m) denote the family of bipartite graphs on (n,m)-vertex sets respectively, and denote by E(n,m) the 
family of all matchings in Q{n,m) (not necessarily maximum matchings). Denote by |A1(G)| the size of a 
maximum matching in the input graph G. We require that the output of any protocol satisfies Al(n) G 
The following definition is central to this work. 

Definition 2.1 (Approximate Matchings). We say that a protocol tt computes an a -approximate matching 
(a > 1) «/ |M(n) n A(G)| is at least ^•|A1(G)|, i.e., if the number of matched pairs {u,v) G E{G) is at least 
a (1 /a)-fraction of the maximum matching in G. Similarly, when the input graph G is distributed according 
to some distribution yt (i.e., {xi,X2, ■.. ,Xn) p-), we say that the approximation ratio of tt is a if 

E [|7W(n)nF;(G)|] >-• e [|ai(G)|]. 

G^fjs O. G'^fi 

The expected matching size of tt is E^[|M(n) fl E{G)\] (we remark that the “hard” distribution we con¬ 
struct in the next section will satisfy |AI(G)| = n for all G in the support of p, so the quantity EG--/i[l-^(^)l] 
will always be n). Note that these definitions in particular allow the protocol to be erroneous, i.e., the referee 
is allowed to output “illegal” pairs {u,v) ^ E{G), but we only count the correctly matched pairs. Our lower 
bound holds even with respect to this more permissive model. 


^More formally, if there is a distribution p on players inputs such that the approximation ratio of any r-round deterministic 
protocol with respect to p is at most a in expectation, then fixing the randomness of the protocol would yield a deterministic 
protocol with the same performance, thus the former lower bound applies to randomized r-round protocols as well. 
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2.2 Information theory 

Our proof relies on basic concepts from information theory. For a broader introduction to the field, and 
proofs of the claims below, we refer the reader to the excellent monograph of |CT91] . 

For two distributions ^ and v in the same probability space, the Kullback-Leiber divergence between /i 
and V is defined as 


D (^(a)||i^(a)) := E, 



M(a) 

v{a) 


( 1 ) 


The following well known inequality upper bounds the statistical distance between two distributions in 
terms of their KL Divergence: 

Lemma 2.2 (Pinsker’s inequality). For any two distributions fi and v, 


\n{a) - < ^ • D (Ai(a)lk(a)) ■ 

A related measure which is central to this paper is that of mutual information, which captures correlation 
between random variables. 


Definition 2.3 (Conditional Mutual Information). LetA,B,C be jointly distributed random variables. The 
Mutual Information between A and B conditioned on C is 

I{A-,B\C)-.= E ID) (Ai(a|6c)||/i(a|c)) = E D (/r(6|ac)||/r(6|c)) = /r(abc) log ^ 

Acb) ^,(ca) M(a|c) 

The above definition can be interpreted as follows: /(A; B\C) is large if the distribution (A|i? = b,C = c) 
is “far” from {A\C = c) for typical values of b,c, which means that B provides a lot of information about A 
conditioned on C. We note that an equivalent, more intuitive definition of (conditional) mutual information 
is I{A-,B\C) = F[{A\C) — F[{A\BC), where F[{A\C) is the (expected) Shannon Entropy of the random 
variable A conditioned on C. Thus A and B have large mutual information conditioned on C, if further 
conditioning on B significantly reduces the entropy of A. We prefer Definition 12.31 as it is more appropriate 
for our proof, but we note that the latter one immediately implies 

Fact 2.4. I{A-,C\D) < H{A\D) < H{A) < |A|, 

where the last term denotes the cardinality of log |Supp(A)| of the random variable A, and the second 
transition follows since conditioning never increases entropy. 

The most important property of mutual information is that it satisfies the following chain rule: 

Fact 2.5 (Chain rule for mutual information). Let A, B, C, D be jointly distributed random variables. Then 
I{AB- C\D) = I{A- C\D) + I{B- C\AD). 

Lemma 2.6 (Conditioning on independent variables increases information). Let A,B,C,D be jointly dis¬ 
tributed random variables. If I{A; D\C) = 0, then it holds that I[A',B\C) < I{A\B\CD). 

Proof. We apply the chain rule twice. On one hand, we have I{A-, BD\C) = I{A-,B\C) + I{A-,D\CB) > 
I{A-,B\C), since mutual information is nonnegative. On the other hand, I{A;BD\C) = I{A;D\C) + 
I{A-,B\CD) = I{A-,B\CD), since I{A-,D\C) = 0 by assumption. Combining both equations completes 
the proof. □ 


On the other hand, the following lemma asserts a condition under which conditioning decreases informa¬ 
tion: 

Lemma 2.7. Let A,B,C,D be jointly distributed random variables such that I{B; D\AC) = 0. Then it 
holds that I{A; B\C) > I{A; B\CD). 

Proof. Once again, we apply the chain rule twice. We have I{A-,B\CD) = I{AD; B\C) — I{D-,B\C) = 
/(A; B\C) -b /(D; B\AC) - I{D; B\C) = I{A- B\C) - I{D- B\C) < I{A-, B\C). □ 
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Fact 2.8 (Data processing inequality, general case). Let X ^ Y ^ Z be a Markov chain (I{X; Z\Y) = 0). 
Then I{X-Z) < I{X;Y). 

Fact 2.9 (Data processing inequality, special case). Let A, B, C he three jointly distributed random variables, 
where the domain of B is D, and let f : Ll —> lA be any deterministic function. Then L[A]B\C) > 
I[A-f{B)\C). 

Fact 2.10. Let yi and v be two probability distributions over a non-negative random variable X, whose value 
is bounded by Xmax- Then + \p.- i>\- Xmax- 

3 A hard distribution for r-round protocols 

We begin by defining a family of hard distributions for protocols with r rounds. Recall that Q{n,m) is 
the family of bipartite graphs on (n, m) vertex-sets. For any given number of rounds r, we define a hard 
distribution pr on bipartite graphs in Q{nr,mr). Pr is recursively defined in Figured] 


A recursive definition of the hard distribution pr 


In what follows, ^ is a parameter (denoting the bandwidth of the communication channel). 

1. For r = 0, = (t/°, E^) consists of a set of no bidders C/° = {bi ,..., and a set of mg 

items = {ji... ,jmo}^ such that uq = mg = £^. is then obtained by selecting a random 
permutation a Gr Sg5 and connecting {hi,ja(i)) by an edge. This specifies pg. 

2. For any r > 0, the distribution pr+i over = (17’'+^, i<;’'+^) is defined as follows: 

Vertices: 

• The set of bidders is [7’’+^ := (Jfci where \Bi\ = Ur- Thus, Ur+i = . 

• The set of items is := Tj where \Tj\ = mr. Thus, m-r-i-i = {nf 1 ■ n^) ■ mr- 

Edges: Let dr be the degree of each vertex (bidder) in the graph G'’ (this is well defined as, by 
induction, it holds that the degree of any vertex is fixed for every graph in the support of Pr). 
The distribution on edges is obtained by first choosing £ ■ random indices {oi, 02 ,... a^.„ 2 } from 
[nr-\-£- nr], and a random invertible map cr : [n^] —> [nf -\- £ ■ n^] \ {ai, 02 ,... Each 

bidder u € Bi is connected to dr random items in each one of the blocks Ta ^, Ta^, ■ ■ ■, Ta^ ^2 j using 
independent randomness for each of the blocks and for each bidder. The entire block Bi is further 
connected to the entire block T^ri^i) using an independent copy of the distribution pr- Note that this 
is well defined, as \Bi\ = nr, |To.(i)| = and Pr is indeed a distribution on bipartite graphs from 
Q{nr,mr). 


Figure 1: A hard distribution for r-round protocols. 

Remark 3.1. A few remarks are in order: 

(i) As standard, the input of each bidder u G is the set of incident edges on the vertex u (defined by 

Pr+i)- Note that every graph in the support of pr+i has a perfeet matching = nr+i). 

(ii) It is easy to see by induetion that : (a) nr = £^ ^ ; and (b) nvr < n^,. 

(Proof of (b): By induction on r, mr+i '■= {nf + £ ■ n() ■ mr < {nf £ ■ nf) ■ n^. < 2n® < n(._^_l). 

(Hi) Note that in Pr+i, each block of bidders Bi is connected to its “hidden item block” T^ri^i) using a copy 

of the joint distribution pr, and to each of the “fooling item blocks” Ta^, using the product of the 

marginals of pr, i.e., according to X„g„^(/ir|M). This property will be crucial. 
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(iv) Throughout the paper, we assume the bandwidth parameter i is larger than some large enough absolute 
constant (note that by (ii) above, in fact I = ujr{l)). 

Notation. To facilitate our analysis, the following notation will be useful. Notice that each block Bi of 
players is connected to exactly i ■ n(: + 1 blocks of items whose indices we denote by 

Ti . — } • 

For each Bi, let : Ii —> [£ ■ n(. + 1] be the bijection that maps any index in Ii to its location in the sorted 
list of Ii (i.e., is the smallest index in Ii, t~^(2) is the second smallest index in Ii and so forth). We 

henceforth denote by G* the (induced) subgraph of G = on the sets {Bi,T^-i^-s^), foreachj S [£-n(. + l]. 
By a slight abuse of notation, we will sometimes write G* = [Bi,T^-i^-,^) to denote the specific set of edges 
of G®. Similarly, for a bidder u G Bi, let G” = denote the (induced) subgraph of G on the sets 

{u, In this notation, the entire input of a player u € Bi is F^ := {G“, Gif,..., G^.^a+i}. Let 

Ji ■= Ti{a{i)) 

denote the index of the “hidden graph” Gj. = {Bi,T,y(^i)). To avoid confusion (with the other indices j), we 
henceforth write 

G{Ji) := GX- 

Note that by symmetry of our construction, the index Ji is uniformly distributed in +1]. The following 
fact will be crucial to our analysis: 

Fact 3.2 (Marginal Indistinguishability). For any bidder u G Bi, it holds that /(r„; Ji \ Ii) = 0. 

Proof. Recall that F^ = {G“, G 2 ,. ■., G“,„ 2 _|_i} is the input of bidder u. The claim follows directly from 
property (Hi) in Remark 13.11 since by definition of our construction, the distribution of edges of G“ = 
(u, Tj.-iq-^) is (pr\u) for all j G [£ ■ n^. + 1]. We remark that the above fact implies that, up to a permutation 
on the names of the items in G“ ^ Gf for any bidder u € Bi and any j ^ fc G [£ • + 1]. □ 

Finally, Let B denote the partition of bidders in U := into the blocks Bi, and T denote the partition 
of items in V := 1/”+^ into the blocks Tj. Throughout the proof, we think of T and B as fixed, while we 
think of the names of the bidders in each block of B and items in each block of T as random. Since T and 
B are fixed (publicly known) in the distribution p-r+i, our entire analysis is performed under the implicit 
conditioning on F,B. Note that T does not reveal the identity of the “fooling blocks” , but only the 
items belonging to each block. 


4 The lower bound 

In this section we prove our main result. Recall that the expected matching size of tt (with respect to /i) is 
E^[|AI(n) n E{G)\]. We shall prove the following theorem. 

Theorem 4.1 (Main Result). The expected matching size of any r-round protocol under Hr is at most 
5nr . This holds as long as the number of bits sent by each player at any round is at most £ = 

In particular, since pLr has a perfect matching, the approximation ratio of any r-round protocol is no better 
than n ^ . 

The intuition behind the proof is as follows. Consider some (r + l)-round protocol tt (with bandwidth £), 
and let Mb, = Mg Mg,,, Mff, denote the (concatenated) messages sent by all of the bidders in a block 
Bi in the first round of tt. From this point on, we will assume that tt is a deterministic protocol (since by 
the averaging principle we may fix its randomness without harming the performance). Informally speaking, 
the distribution p-r+i is designed so that messages of bidders in Bi (Mg,) convey little information about 
the “hidden” graph G(Ji). Intuitively, this will be true since the marginal distribution of the hidden graph 
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Gj. for any bidder u G Bi is indistinguishable from the rest of the “fooling graphs” (Fact 13.21) and therefore 

a bidder in Bi will not be able to distinguish between vertices (items) in '^a.j and in Ta-i^i). Using the 
conditional independence properties of the distribution /i^+i and the simultaneity of the protocol, we will 
show that the latter condition also implies that the total information conveyed by Mg. on G{Ji) is small. In 
order to make this information <C 1 bit, the parameters are chosen so that Ur grows doubly-exponentially in 
r (jir = ), and this choice is the cause for the approximation ratio we eventually obtain. Intuitively, the 

fact that little information is conveyed by each block on the “hidden graph” implies that the distribution of 
edges in the graph G{Ji) is still close to /Xr even conditioned on the first message of the i’th block Msi- Now 
suppose an (r + I)-round protocol finds a large matching with respect to the original distribution /Xr+i (in 
expectation). Then the expected induced matching size on G{Ji) must be large on average as well. Hence, 
“ignoring” the first round of the protocol, we would like to argue that the original protocol essentially induces 
an r-round protocol for finding a large matching with respect to the distribution /x^., up to some error term 
(indeed, some information about G{Ji) may have already been discovered in the first round of the protocol, 
but the argument above ensures that this information is small). Doing so essentially reduced the problem 
to finding a large matching under /x^. using only r rounds, so we may use an inductive approach to upper 
bound the latter expected matching size. 

Making the latter intuition precise is complicated by the fact that, unlike standard “round-elimination” 
arguments in the two-party setting, in our setup one cannot simply “project” an r-round n^+i-party protocol 
(with inputs ~ t>-r+i) directly to the distribution /x,., since a protocol for the latter distribution has only 
Ur players (inputs). To remedy this, we crucially rely on the conditional independence properties of our 
construction fLemma 14.61 belowi together with an embedding argument to obtain the desired lower bound. 

The embedding part of the proof fClaim lTTTl) is subtle, since in general, conditioning on the first message 
Ml correlates the (private) inputs of the players with the “missing” inputs to the “higher-dimensional” 
protocol (the “fooling item blocks” of ^J^r+l), so it is not clear how the players can sample these “missing” 
inputs without communicating. Luckily and crucially, the edges to the “fooling blocks” Ta^ in /Xr.+i were 
chosen independently for each bidder u G U (unlike the hidden graphs G{Ji) in which players have correlated 
edges). This independence is what allows to embed a lower-dimensional graph H ^ fir and “complete” the 
rest of the graph (using a combination of public and private randomness) according to the conditional 
distribution (G|Mi,iL) without any communication, thus “saving” one round of communication. 

We now turn to formalize the above intuition. From this point on, let us use the shorthands 

J . ,..., , T . Bi , X2,..., . 

Also, for the remainder of the proof, let us define for simplicity 



Ur 


Let TT be an (r -|- l)-round deterministic protocol. For a given message rnsj := ..., sent in tt 

by the bidders in block Bi in the first round of tt, a fixing of the index Ji = ji of the “hidden” block of items, 
and of the partition Xi , let 

:= (G(J,) I Mb, = = ji,X,) 

denote the distribution of the “hidden graph” G{Ji) conditioned on MBi,Xi and Ji. The following lemma 
asserts that, in expectation over the first communication round of tt, the marginal distribution of G{Ji) is 
very close to its original distribution /x^. 

Lemma 4.2. For every % G [nf], 

E -/xri] < Ay^. 

niB, 

Proof. We begin by showing that the local message Mf^, of any bidder u G Bi conveys little information on 
G{Ji). Note that Fact 13.21 land the Data Processing inequality fFact 12.91) i together imply that, for any block 
Bi of bidders and any u G Bi, 


I{M^^-,J,\Xi) = 0. 


( 2 ) 
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(Note that in contrast, Ji \ 1-i) ^ 0. In fact, Ji may be almost determined by the entire message of 

the i’th block (let alone by the entire message Mi of tt), as the induced distribution of G{Ji) is different 
than that of G*, j ^ Ji- This is where we crucially use the simultaneaty of bidder’s messages). We will also 
need the following proposition: 

Proposition 4.3. For any bidder u G Bi and any j G [£ • + 1], it holds that 

|J„ J, = j) < I{Ml;G] |I,)- 

Proof. Recall that r„ = {G“, G 2 , ■■■, G^.^ 2 _|_i} is the input of bidder u, and notice that for any j G [^-n^ + l], 

(M)), Ji = j) ^ (r„|i„ Ji = j) ^ (G“|J„ Ji = j) ^ (g; |j„ Ji = j) 

is a Markov chain where the left chain holds since, conditioned on (r„,2i, = j), is completely 

determined and therefore independent of G* and G“, and the right chain holds since conditioned on 2^, J, = j 
and the graph G“, the rest of the edges of the graph G] are independent of r„ by construction. Therefore, 
by the (general) Data Processing inequality (Fact I2.8|l , we have 

/(M)),; G} |2„ Ji = j) < I{M^^-,G^ |2„ J, = j). (3) 

Now, by Fact 13.21 we know that the distribution of (r„|Ii) is independent of the event “Ji = j”. Since Mg. 
and G“ are deterministic functions of r„ (conditioned on li), this also implies that the joint distribution of 
(Mg., G”|Ji) is independent of the event “Ji = j”. Therefore, we conclude by ([3]) that 

I {Ml ; G) \I„ J, = j) < I{Ml ; G“ |2„ J, = j) = I{Ml ; G“ |2,)- 


□ 


We proceed to prove the Lemma. We may now write for any u G Bi 


’■<+1 


I{Ml-G{Jf) |J„I.) = 


' + 1 


^ J(M]),; Gj |I.,J.=j) 


t=i 


(By definition of conditional mutual information and since Ji Gr [£ • + 1] and by ([3])) 

Gnl + l 


< 


1 


ni + 1 


/(Mg. ;G“|Ii) (By Proposition 1331) 


i=i 

'■rJ + l 


< 


-4—- E /(M]).;G“|G)‘,G^,...,G“_i,J.) 


1 


< 


/ • + 1 

/ • + 1 


■ /(M]),; G)*, G^,..., G^.„ 2 +i |2.) (by the chain rule) 


~ i-ni + 1 ~ 


(by Fact 12.4|) 

^ 1 a2 

- < — = 

9 I 1 ^ 9 

■ n^ + i ni. 


(4) 


(5) 


where the inequality in (jd]) follows from Lemma 12.61 taken with A = Gf,B = Mg.,G = Ii,D = G'fj, since 
G" is independent of G'^j for all j, conditioned on Ji. 

Now, we claim that, for each bidder u G Bi, conditioning on the previous messages of the bidders 
(Mg.“ := Mg.Mg. ... Mf^^) can only decrease the information Mg. reveals on the hidden graph G{Ji): 

Claim 4.4. I{Ml-G{Ji) \M<lJi,X,) < I{Ml-,G{J,) |J„J,). 









Proof. By construction of ^r+i, conditioned on G{Ji),Ii and Ji, the inputs of bidders u and Bi \ {u} are 
independent. In particular, this fact and the data processing inequality (Fact 12.91) together imply that 


since tt was assumed to be a deterministic protocol. By non-negativity of information and the chain rule, 

I{M^^-G{Ji) |M<“, 


□ 


We conclude that 

E [id (')/'(.II/Tr)] = I{MBr,G{Ji) \ Ji,Xi) (by Definition [231 of conditional mutual information) 

msi ,ji ,Ii 

= ^ I{MQ.;G{Ji) (by the chain rule) 

uGBi 

< Y, I(Mf,^;G{A) IJ„I,) (by ClaimSH) 

uGBi 

<\B,\-Al (by®) 

= Ur ■ Al = Ar. ( 6 ) 

Combining ®, Pinsker’s inequality iLemma 12.21) and convexity of completes the entire proof of the 
lemma. 


□ 


We are now ready to prove Theorem 14.11 To this end, for any r-round protocol tt, input graph G, and 
induced subgraph i? C G, let 

N^iG,H) := |7W(n(G))n£;(i7)| 

denote the size of the matching computed from tt’s transcript with respect to the subgraph H (note that 
Atn.(G, H) is a random variable depending on G). For notational convenience, we use the shorthand iV^(G) := 
Nt^{G,G). Theorem 14.II will follow directly from the following theorem: 

Theorem 4.5. Let tt be an r-round (deterministic) communication protocol with bandwidth 1. Then 

ac) + 1- 


E [A^^(G)] < hUr 

G'^fLr 


Proof. We prove the theorem by induction on r. Let us denote 


t{r) := bUr ■ 




^fc=0 


1 . 


For r = 0 (namely, with no communication at all), the expected number of edges the referee guesses correctly 
under /tq is at most no • ^ = 1 = t(0) (as G° ^ /xq is a random permutation on [no]). 

Suppose the theorem statement holds for all integers up to r. Thus, the expected matching produced by 
any r-round protocol 0 (with bandwidth < £) under satisfies 


E [fV,(G)] < t(r). 


(7) 
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We need to show that the expected matching produced by any (r + l)-round protocol tt (with bandwidth 
< £) under /i^+i satisfies 

E lN^(G)]<t(r + l). (8) 

G~/ir4-l 

Let TT be an (r + 1 (-round protocol. Recall that G ^ fJ,r+i consists of nf “blocks” Bi of bidders, each of 
which is connected to exactly \Xi\ =£• -|-1 item blocks. Let Mi := M^^Ms^ ... Ms^^ denote the messages 

sent by each block of bidders in the first round of tt (where Mb^ = M^.,M^.,... is the concatenated 

message of all bidders u G Bi). Recall that for every i G [nf], G{Ji) denotes the induced subgraph of G on 
(Ri, ^), and that for every bidder u G Bi, G'j^ = (w,T^-i(j.)) denotes the induced subgraph between 

bidder u and the “hidden graph” of the i’th block to which u belongs. In the same spirit, for every block Bi 
and every bidder u € Bi, let 

/ e-nl \ / i-nl \ 

G(r,) := f R,, U TaA , := L, |J 

denote the induced subgraph on the block Bi (on the bidder u G Bi) and all “fooling blocks” respectively. As 
usual, for any subset S C [nf], we write G(Ts) := (UieS Uj=r ^se the convention T := T[„4]. 

In what follows, G(J) := G(Ji)G(J 2 )... G(J„4) denotes the (concatenation of the) “hidden” graphs. The 
following proposition will be essential for the rest of our argument: 

Lemma 4.6 (Conditional Subgraph Decomposition). The following conditions hold: 

1 . ((G^,G^,...,GJ'-) |Mi,G(Ji),J,I) ^ X„gB^(G^|Mi,G}^,J,I), 
where {1,2,..., n^} are the bidders of the first block Bi. 

2. (G(J),G(T) I Ml, J,I) ^ X,g[„4](G(J,)G(r,) | A/b.,J,I). 

That is, the joint distribution of the ‘fooling subgraphs” Gif of each bidder u G Bi conditioned on the entire 
message Mi and the “hidden graph” G(Ji) of the first block, is a product of the marginal distributions Gf 
conditioned only on the “local hidden part” Gj^ of each bidder and Mi. 

Furthermore, the joint distribution of the subgraphs induced on each block (G{Ji)G{Ti)) conditioned on 
the entire message Mi is a product distribution of the marginal distributions of the i ’th block, conditioned 
only on the “local” message of the respective block Mb, (In particular, these graphs remain independent even 
conditioned on Mi). 

The intuition behind the second proposition is clear: Since in the original distribution p^r+i, the graphs 
of each block are independent by construction, this remains true even when conditioned on the first (de¬ 
terministic) message of each block. The first proposition is more subtle, since within the same block (say 
Bi), the inputs of the bidders u G Bi are correlated (via the hidden graph G(Ji)). However, conditioned on 
knowing the hidden block (J,I), the marginal distribution of the fooling graph Gf is independent for each 
u by construction, and therefore the only correlation between G(Ji) and Gf created by conditioning on the 
message Mi, is correlation between the “local hidden graph” of bidder u (G“^) and Gf. We remark that this 
fact will be used crucially in the embedding argument below IClaim 14771) . We proceed to the formal proof. 

Proof of Lemma \j.6[ We repeatedly use Lemma 12.71 

Proof of (1) It suffices to show that for every u G Bi, /(G^; G.)i“Gj“|Mi, J,I, Gj^) = 0. To this end, 
observe that 


/(G^;Mf“|Mi“,G(Ji),G{;“, J,J) < i7(Mf“|G(Ji),G{;“, J,J) = 0, (9) 

since the message Mj““ of all bidders in Bi except bidder u is fully determined by the inputs (G(Ji), Gy“). 
For the same reason, 

/(G{;“G7“;Mr|G}^,G^,J,I) < H{Mf\Gl,Gf,3,T)=0. (10) 
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Therefore, 


/(G^; G^“G7“|Mi, J,I, Gl) = /(G^;; G7“G7“|Mr, J,I, G}J 

< /(G7;G7“G7“|M“, J,I,G7J (By Lemmawith L> = Mf“, and (0) 

< /(G^.; G7“G7;|J,I, G7J (By Lemma O with D = M“, and (HU])) 

= 0, as desired. 

Proof of (2) It suffices to show /(G( Ji)G(Ti); G( J_i)G(T_i)MB_jMB., J,I) = 0. Once again, applying 
Lemma O with D = Mb^ , we have 


/(G(J,)G(T,);G(J_i)G(r_,)|MB,, J,I) < /(G(J,)G(r,);G(J_,)G(r_0|J,I) (11) 

since /(M^.; G( J_i)G(T_i)|G( J^), G(Ti), J,I) < iL(MBjG(Ji), G(Ti), J,I) = 0 where the last equality is 
because is determined by the input of block Bi. The same argument implies 


/(G(J0G(r,);MB_JMB,, J,I,G(J_0G(r_0) < /(G(J,)G(rO;G(J_,)G(T_0) (12) 

since once again,/(Ms-; Mb_JG(J), G(T), J,I) < iL(MBjG(J), G(T), J,I) =0. Combining equations (|TT]) 
and m, we conclude by the chain rule that 

I{G{J,)G{T,y, G( J_,)G(T_,)Mb_, \Mb, , 3,T) 

= /(G(J,)G(T,);G(J_i)G(T_,)|MB,,J,I)+/(G(J,)G(T,);MB_jMB,,J,I,G(J_,)G(T_i)) 

< I{G{Ji)G{Ti)- G(J_i)G(T_,)|J,I) + I{G{jy)G{Ti)-, Mb_JJ,I, G(J_,)G(T_i)) 

= 0, (13) 

where the last transition follows from the definition of /ir+i, and since is a deterministic function of 

G(J_i)G(T_i) conditioned on J,I. 

□ 


We now proceed to prove 0, the inductive step of the proof. Recall that tt is assumed to be deterministic, 
but Ml is still a random variable (under the input distribution ^br+i). Hence, we may equivalently draw 
G ~ /ir+i by first sampling the first message mi ~ Mi, and then sampling G ^ /rr+i|mi. Let us denote by 
7r|mi the protocol which is the subtree of tt conditioned on the first message being mi. Note that 7r|mi has 
only r rounds of communication. We therefore have 


E [A^7r(G)] < E E 

G-Mr+I G|mi.J,I 


ril ■ rUr + ^ {G, G{Ji)) 

i=l 


(14) 


since any matching in G can at most match all of the items in Ui=i Taj and each block Taj contains m^ 
edges by definition of ^r+i, and the rest of the matched edges are contained in G( Ji), G( J 2 ),... ,G(J„4). 
Recall that by definition of /ir+i, G{ Ji) ^ fXr- Hence by linearity of expectation and the second proposition 
of Lemma 14.61 we may equivalently write the above as 


= i ■ n„ ■ m. 


+ V E 

mi 


^71 G(Ji)|(mB,.J,X) 

’ G| (G(Ji).mi.J,I) 


= £■ ■ nir + E E 

’ G| (G(Ji).mi.J.X) 


E [iV^|„,(G,G(J,))] 

[iV^|™,(G,G(J,))] , 


(15) 
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by definition of ipl (actually, V'r is defined conditioned only on Ji,Ii but conditioning on all indices 3,1 
clearly doesn’t change the distribution). By symmetry of the distribution ^r+i, it suffices to upper bound 
the first term in the above summation 

E E [N^\mAG,GiJi))]. 

’ G| (G(Ji),mi,J,X) 


To this end, we have 


E E 

G(Ji)-y-) 

' G| (G(Ji),mi,J.I) 

< E [\tpl - Hr\] ■ n. 

mi 

J,X 


< AIG . Ur + 


[iV,|,^,(G,G(Ji))] 

[iV^I„,(G,G(Ji))] 
[lV^I„,(G,G(Ji))] 


■ E 

mi 

J,X 


E 
G( 

G| (G(Ji),mi,J.I) 


E E 

■'’-^Gl (G(Ji),mi,J,I) 


(16) 

(17) 


where follows from fact 12.101 since trivially A^ 7 r|mi (G, G( Ji) < |G’'| = Ur for any G(Ji), and the last 
transition (II3 follows from Lemma We now wish to use the inductive hypothesis to argue that the 
rightmost term of (El) cannot exceed the expected matching size of an r-round protocol over jjLj.- We do so 
using an embedding argument, which is the heart of the proof. 

Claim 4.7 (r-round Embedding). 

E E [lV^|„,(G,G(Ji))] <t(r). 

71 G(Ji)-a*. 

(G(Ji).mi.J,I) 


Proof. Notice that the protocol 7r|mi is defined over inputs from /r^+i and not fir, so we cannot apply the 
inductive hypothesis directly to obtain our desired upper bound. Instead, we will “embed” H ^ fir into 
TT\mi by simulating the rest of the players using public randomness (and then fix the public coins to obtain 
a deterministic protocol). To this end, for every bidder u € t/”, denote by Hu the induced subgraph of H 
on the vertex u (i.e., the input of bidder u in fir). 

Consider the following r-round randomized protocol r for H ^ fir- The rir players use the shared random 
tape to sample {Mi, 3,1) (according to the probability space of tt). Then, they “embed” their inputs (the 
graph H) to the first block Bi and each bidder u £ Bi “completes” his missing edges to the “fooling 
graph” G^ according to (G^|Mi, iJ^i, J,I) using private randomness. Note that this is possible due to the 
first proposition of Lemma 14.61 since it asserts that (G^|Mi, 77, J,I) ^ (G^|Mi, i7„, J,I). The players 
now use the second proposition of Lemma 14.61 to sample the graphs of the rest of the blocks according to 
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{Ji)G{Ti) I MBi,3,X), as the proposition asserts that these subgraphs remain independent after the 
conditioning. 

This process specifies a graph G such that H fir and G ~ fir+i conditioned on G(Ji) = H,Mi,3,X. 
Notice that so far the players have not communicated at all. The players now run the r-round protocol 7r|mi 
and outputs its induced matching on H. Notice that this protocol is well defined, as the messages of bidders 
outside block Bi in 7r|mi in every round (r £ {2,3,..., r -I-1}) are completely determined by their respective 
inputs on the random tape and the content of the blackboard, since tt was assumed to be a deterministic 
protocol. Call the resulting protocol r. 

By construction, the expected matching size of r (over the private and public randomness Mi, J,I, G) 
with respect to H is 


E E 

^Xg\ (G(Ji)=ff.mi,J.I) 


[iV,|„,(G,M)] = E 

j,i 


E 

G\ (G(Ji).mi.J.I) 


[iV,|„,(G,G(Ji))] . 


By the averaging principle, there is some fixing of the randomness of r that obtains (at least) the same 
expectation as above with respect to H ^ fir. Call this deterministic protocol r'. But t' is a deterministic 
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r-round protocol over hence the inductive hypothesis asserts that 


E E [7V^|„,(G,G(Ji))] < E [Nr>{H)] < t(r), 

G| (G(Ji),mi,J,Z) 


as claimed. 

□ 

We are now in shape to complete the entire proof of Theorem 14.51 Plugging in the bounds of (ED and 
Claim ITT tI into equation (ITSl) . we have 


E [A^,r(G)] < £ ■ ■ nir + n'^ ■ • rir + t{r) 


= £ ■ nl ■ TTir + nf. 


• Ur + 5nr ■ ( ^ A^^ j + 1 


\k=0 


< 


/ r — 1 


brir ■ Ay^ + BUr ■ Ea 


1/2 

k 


Kk^O 


(since £ ■ ny rur +n‘^ < 4n® Aj/^) 


\k=0 


= 5n® • I^E J = 5nr+i ■ | ^ A^^ j (since by definition, Ur+i = 

= t(r + 1) - 1 
< t{r + 1). 


This proves the induction step ([SD , and therefore concludes the entire proof of Theorem 14.5 


(18) 

□ 


Theorem 14.51 immediately implies our desired lower bound: 

Proof of Theorem By Theorem 14.51 the expected matching size of any r-round protocol tt under is 
at most 


E [A^(G)] < t{r) = Bur ■ 

G'^ fj’v 



+1 



since by definition of /ir, Ur = and no := £^, hence Ur = . Since this is a doubly-exponential 

decaying series (and as long as £ is large enough than some absolute constant), we can upper bound the sum 
by, say. 


T/5 


< Bur ■ Ag^ =Bnr- 


1/5 


= J ■ Ur = B ■ 


l/S- 




since, by property (4) in Remark 13.11 Ur = £^ 


£ = Ur 


□ 
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